Data Munging

  • Data Munging

    • Missing values
    • Data Enrichments
    • Data Melting
    • Clean unicode
    • Feature Engineering and Variables Transformations
  • Model builtding

    • Data encoding
    • Feature scaling
    • Feature Selections





Missing Values

Variable Transformations and Feature engineering

Regular expressions: misspellings, regular expression patterns e https://www.analyticsvidhya.com/blog/2016/01/guide-data-exploration/

Data enrichment






In [ ]:
https://github.com/NathanEpstein/Dora
    Feature Selection & Extraction

In [ ]:


In [ ]:
https://github.com/LuminosoInsight/python-ftfy
        Given Unicode text, make its representation consistent and possibly less broken.

In [ ]:


In [ ]:
http://nbviewer.jupyter.org/github/rasbt/pattern_classification/blob/master/preprocessing/about_standardization_normalization.ipynb
About Feature Scaling: Z-Score Standardization and Min-Max Scaling

In [ ]:


In [ ]:
https://github.com/NathanEpstein/Dora
data versioning